Supercritical general branching processes 
conditioned on extinction are subcritical 



Peter Jagers** Andreas N. Lageras^ 
February 2, 2008 

Abstract 

It is well known that a simple, supercritical Bienayme-Galton- Watson 
process turns into a subcritical such process, if conditioned to die out. We 
prove that the corresponding holds true for general, multi-type branch- 
ing, where child-bearing may occur at different ages, life span may depend 
upon reproduction, and the whole course of events is thus affected by con- 
ditioning upon extinction. 
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1 Introduction 

The theory of branching processes was born out of Galton's famous family ex- 
tinction problem. Later, interest turned to populations not dying out and their 
growth and stabilisation. In more recent years, extinction has retaken a place 
in the foreground, for reasons from both conservation and evolutionary biology. 
The time and path to extinction of subcritical general populations was studied in 
pj]. Here, time structure is crucial, and life spans and varying bearing ages can- 
not be condensed into simple, generation counting Bienayme-Galton- Watson 
processes. Thus, the question arises whether (non-critical) general branching 
populations bound for extinction must behave like subcritical populations. 

We answer this in the affirmative: a general, multitype branching process 
conditioned to die out, remains a branching process, but one almost surely dying 
out. If the original process was supercritical but with a strictly positive risk of 
extinction, the new process is subcritical. 

Formulated in such a loose manner, this fact belongs to the folklore of 
branching, but actually it has been proved only for Bienayme-Galton- Watson 
processes, [I], p. 52. A moment's afterthought tells us that it remains true 
for age-dependent branching processes of the Bellman-Harris type, where indi- 
viduals have i.i.d. life spans, and split into i.i.d. random numbers of children, 
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independently of life span, the time structure thus not being affected by the 
conditioning. 

But what if the flow of time is no longer independent of reproduction? Even 
the simplest case, that of a splitting reproduction at death, but not indepen- 
dently of age at death/splitting, would seem to offer difficulties, and the same 
certainly applies to the more realistic general processes where reproduction oc- 
curs as a point process during life, thus mimicking the yearly births of wildlife, 
or the even more erratic reproduction pattern of humans. 

The conceptual framework is intuitive. Starting from an Eve, individuals live 
and give birth independently of one another. At birth each individual inherits a 
type from her mother. The type, in its turn determines the probability measure 
over all possible life careers, including a life span and a marked point process 
which reports the successive ages at bearing, and the types of children at the 
various bearings. Note that multiple births are not excluded. The branching 
property can be summarised into the fact that given her type and birth time, 
the daughter process of any individual born is independent of all individuals not 
in her progeny (into which she herself is included). 

We set out to prove that this branching property remains true for processes 
conditioned to die out. Initially, we shall not mention supercriticality, and only 
ask that the probability of extinction is non-zero for any starting type. (If 
that probability is one, the conditioning does not change anything!) Largely, 
the proof is a matter of conceptual clarity or discipline, which unfortunately 
forces us into the somewhat burdensome notation of probabilities on tree spaces, 
obscuring the essential simplicity of the matter. 

The main idea behind the proof is, however, easily outlined. Indeed, con- 
sider an individual, and condition upon her progeny ultimately dying out. Her 
own life career is then affected precisely through her only being able to have 
daughters whose progeny in their turn must ultimately face extinction. In all 
other respects her life is independent of all others, once her type is given. This 
reestablishes the branching character, but with a suitably amended probability 
measure over her life career, which clearly is non-supercritical in the sense that 
the probability of ultimate extinction is one, from any starting type that can be 
realised. 

If the original process is, furthermore, supercritical, i.e. has a positive 
Malthusian parameter, the conditioned process will turn out to be subcritical, 
in the sense of the Malthusian parameter being negative, if it exists. 

2 Notation 

2.1 The Ulam-Harris family space 

We choose to work within the classical Ulam-Harris framework, 0] , identifying 
individuals with sequences of natural numbers so that x = (x%,X2, ■ ■ • , x n ) de- 
notes the x n th child of the ... of the X2th child of the xith child of the ancestor. 
The ancestor is denoted by an "empty" sequence e (mnemonic for "empty" or 
"Eve" ) , and the set of all possible individuals is 
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The concatenation of x, y G T is xy, and thus ex = xe — x for all i£l 

For any e ^ x = (xi, x%, . . . , x n ) x's mother is mx = (x\, . . . , x n -i), her 
rank in the sibship and x's generation g(x) = n. We agree that 

me = re — e and g(e) = 0. Hence, mxrx — x for x G T, and m can be iterated 
so that m n x is x's nth grandmother, provided g{x) > n. 

Clearly x stems from ?/, usually written x >z y, if m n x — y for some n G 
N U {0}, or equivalently if there exists az£T:i = yz. In this terminology, x 
stems from herself, x < x. In other words, (T, ■<) is a partially ordered set (a 
semilattice). We define x ~ y if x > y or x -< y, i.e. x and y are in direct line of 
descent. (~ is not an equivalence relation.) 

For A, B C T, x G T, we write x >r A if there exists a y £ A such that 
x h y, and A ^ B if a; >r A for all x G B. The progeny of A C T is defined as 
PrA = {x G T : x b 

We call a set L C T a stopping line, or line for short, if no two members of 
L are in direct line of descent: x, y £ £, x ^ y => x 9^ y. We say that a line L is 
a covering line if for all x G T there exists a, y £ L such that x ~ y. 

2.2 Life space and population space 

Let (f2^ , ^) be a Zi/e space so that cj G f2f is a possible life career of individuals. 
Any individual property, such as mass at a certain age or life span, is viewed 
as a measurable function (with respect to the a-algebra s^t) on the life space. 
This should be rich enough to support, at least, the functions r(k),a(k) for 
fc G N. Here r(fc) : fi^ — > K + U {00} is the mother's age at the fcth child's birth, 
< t(1) < r(2) < ■ ■ ■ < 00. If r(fc) = 00, then the fcth child is never born. 
er(fc) : Sl^ — ^ *S is the child's type, obtained at birth. The type space S has 
a (countably generated) er-algebra 5? ' . The whole reproduction process is then 
the marked point process £ with £(A x B) = ^{fc : cr(fc) G A, r(k) G B} for 
A e y,B e the Borel algebra on R + . 

The population space is defined as (fi, &f) = (S X ftj,y x £^ T ). J7m is 
the projection — » Sl* f , for M C T. For simplicity U x = U^ x y and similarly 
Prx = Pr{x}. The following a-algebras are important: 

& L = y x a(U x :x^L) = yx a(U x : x ^ PrL), 

for L G T. Since L < M ^- PrL D PrM => C J^/, it holds that 
(J?l : i G T) is a filtration under <. In the usual manner, the definition of the 
CT-algebras J^z, can be extended to er-algebras of events preceding random lines 
C which are optional in the sense that events {C ^ L} G Tl [4]. 

Functions £, r(fc) and cr(fc) were defined on the life space but we want to 
be able to speak about these quantities pertaining to a given x G T. We write 
6r = £ U x , x's reproduction process, t x — r(rx) o U mx , x's mother's age at x's 
birth, and x's type a x = a(rx) o U mx . Note the difference between r(fc) and rfc, 
cr(fc) and CTfc, for fc G N C T. 

Finally, the process is anchored in real time by taking Eve to be born at 
time 0, and later birth times t x ,x G T recursively determined by t e = and 
tx = t mx + t x for e 7^ x G T. The meaning of t x = 00 is that x is never 
born, so that — {x G T : t x < 00} is the set of realised individuals. This 
set is optional, is well defined [4], and so is the a-algebra To#, of events 

pertaining only to realised individuals. The probability space restricted to such 
events is that where a branching processes really lives, cf. jS], [2J. 
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2.3 The probability measure and branching property 



The setup is that for each s£5 there is a probability measure P(s, •) on the life 
space (Sle,£/e), such that the function s — > P(s, A) is measurable with A 6 sfi. 
For any s£5 this kernel (the life kernel) defines a population probability measure 
P s on (Q, sf) with an ancestor of type o~ e = s and such that given a x , x's life 
will follow the law P(cr x , •) independently of the rest of the process, see [I]. 

Indeed, the basic branching property of the whole process can be charac- 
terised by a generalisation of this in terms of the mappings S x — (o~ x , Up rx ) : 
S x T — > S x T, which renders x the new Eve. Let T x = S 1 " 1 and {A x : x £ 
L} C sf. Then, 



xEL 



for lines L. This remains true for optional lines and in particular 

pi T X A X F Ln3e )= n (i) 



where the intersection over the empty set is taken to be and the empty product 
is ascribed the value one. The interpretation is that the daughter processes of 
all realised individuals x in a line are independent given the prehistory of the 
line with the population probability measure P<j x , the only dependence upon 
the past thus being channelled through the type o~ x and the placing in time t x . 
This is the branching property. We shall see that it remains true for processes, 
bound to die out. 



3 Conditioning on extinction 

Denote by E the event that the branching process starting from Eve dies out, 
i.e. that has only a finite number of elements. Let q s = F S (E) and E x be the 
event that the branching process starting from x dies out, E x — T X E. Write 
P s (-) — P s (-|£'), which clearly only makes sense for s £ S such that q s > 0, and 
let E s denote expectation with respect to P s . 

Theorem 1. Any branching process conditioned on extinction remains a branch- 
ing process, but with extinction probability one. Its life kernel is P(s, A) :— 
P S («S x A x ttj^^) for A G sff. Thus, for any covering lines L and {A x : x £ 
L} C sf 

f| T x A x T Ln ^\= J] ^.(Ar). (2) 
■-*<-«& ' xeLnSt 
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Furthermore, the Radon-Nikodym derivative dP s /d¥ s with respect to the o~- 
bra J-'LnSi *s given by 



i n 



Proof. First, note that 

E= f| T X E, 
x£Ln& 



(3) 
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for covering lines L. Indeed, since {L n M — 0} C E and intersection over an 
empty index set yields the full space, 

E = (£n{in* = t)})u(£n{Lnif ^«}) 

{in« = 0}n p| t x e j u ({ L n st + 0} n p| r^J 



The branching property |T]) implies that 

F S (E\T L )= H qa m =¥,(E\T LnM ). 

Hence, for any covering line L and A 6 J-i, 



(4) 



P S (A) =E S 



and thus ([3|) holds. Equations |[TJ and Q yield 



and ([2]) follows. 

Remark. For L = N, the first generation, 



□ 



dP, 



7 n 

ys k&i:k<X 



where X := £ e (S x K + ) is Eve's total offspring. In single-type processes with 
extinction probability q therefore 



dP 



(5) 



and in the Bienayme-Galton- Watson case 

P(X = k) = W L q x - 1 ;X = k] = F(X = k)q k - 1 , 
in perfect agreement with [I] Theorem 1.12.3]. 

Example. Consider single-type Sevastyanov splitting processes, where individ- 
uals have a life span distribution G and at death split into k particles with 
probability pk(u), if the life span L = u. By ([5|) we conclude that 



Hence 



(X = k,L e dw) =E[q A - 1 ; X = fc,L G dw] = p fc (u)<r -1 G(diO 



G(du) = P(L e dtt) = ^p fe (w)9 fe - 1 G ! (dt 
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and F(X = k,L 6 du) = p k (u)G(du), with 



Pfc(u) 



Pk(u)q k 1 



and the conditioned Sevastyanov process remains a Sevastyanov process with 
life span distribution G and splitting probabilities pk (u) . 

Finally, we address the question of whether a supercritical process condi- 
tioned on extinction is subcritical. It is clear that the conditioned branching 
process has extinction probability one, for any starting type, but this would also 
be the case if the process were nontrivially critical. 

In the single type case it follows from ([5]) that the expected total number of 
children per individual in the conditioned process satisfies 

E[X]=E[Xq x - 1 } =f'(q), 

in terms of the offspring generating function / of the embedded Bienayme- 
Galton- Watson process. It is well known that f'(q) < 1 if the original process 
was supercritical, see [T], and we conclude that the conditioned process is sub- 
critical. 

Also in the general case it is enough to consider the embedded generation 
counting process. If X n (A) = cardja; G N™ PI M : a x G A} denotes the number 
of individuals in nth generation of type A and q := sup s q s < 1, 



i. [*«(£)] 



X n (S)eJs 



S s \ogq r X n (dr) 



/q s < K s 



X n (S)q X ^ 



/q s -» 0, 



as n — » oo (by dominated convergence, since X n (S) must either tend to zero 
or to infinity) . But the expected size of the embedded process tending to zero 
means exactly that the process is subcritical. 
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